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Fractals with Positive Length and 
Zero Buffon Needle Probability 

Yuval Peres, Karoly Simon, and Boris Solomyak 


1. INTRODUCTION. The classical definition of the length of a curve, based on 
polygonal approximation, is not useful for measuring the “length” of more complicated 
sets. For a Borel (e.g., closed or open) subset F of R, the Lebesgue measure |F| of F, 
which is defined by 


\F\ = inf 


Ei / 'i :Fc U / ' 


(i) 


using countable covers by intervals, works well (here |/| signifies the length of an 
interval /). This definition was extended by Caratheodory [5] as follows. 


Definition 1. For a Borel set F in R d let 


H l (F) = liminf |^diam(F i ) : F C , diam(Z?;) < e 


where the infimum is extended over all countable covers of F by closed sets of 
diameter at most s. 


It turns out that TL X is a measure, now called one-dimensional Hausdorff measure 
because it was generalized by Hausdorff [10] to the whole family of measures H a , 
where a is any positive number (integer or noninteger). The modem theory of “frac- 
tals” is largely based on the notion of the Hausdorff dimension dim# (F) of a set F, 
defined by dim//(F) = inf {a > 0 : 7i a (F) = 0}. We recommend the book by Fal- 
coner [8] for an introduction to fractals. 

Here we consider only a = 1, since we are interested in the generalizations of 
length. If F is a rectifiable curve, then H x (F) is exactly its length; therefore in modem 
analysis, it is standard to refer to H l (F) for any compact set F as the “length of F .” 

Another way to measure length goes back even further, to the eighteenth century. 
In 1733 Georges-Louis Leclerc, the “Comte de Buffon,” posed the following problem, 
which became known as “Buffon’s needle problem”: Given a collection of parallel 
lines in the plane , with distance d between adjacent lines , determine the probability 
that a needle of length l < d will cross one of these lines when dropped at random 
on the plane. The answer 21/ (ltd) was given by Buffon himself in 1777 and can be 
found in many probability texts (for example, in [9]). It follows from Buffon’s formula 
that if a polygon of perimeter p and diameter less than d is dropped on the same 
plane, then the expected number of points at which it will cross one of the parallel 
lines is 2p/(jtd). This idea was formalized and extended in 1868 by Crofton (see [6] 
and [19]), as follows. Count the number of intersections of a given set F with a straight 
line, and then integrate this number over the space of all lines. The result is denoted 
by I\(F) and called the integral- geometric measure of F. More precisely, let Tl e be 
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the orthogonal projection onto the line L e passing through the origin that makes an 
angle 6 with the horizontal. Then 


h(F) = 



card(F n W e l (y)) dy d6. 


When F is a rectifiable curve, 7j (F) is twice its length (see [18]), but for other compact 
sets in the plane I\{F) and 2H l (F) can differ sharply; in the next section we provide 
examples for which Tt l (F) > 0, whereas I\(F) — 0. 

It is easy to see that f (F) = 0 if and only if | I\ e (F) \ = 0 for almost all 0 in [0, 7r). 
Another notion of “length” in the Euclidean plane, called Favard length (see [3]), is 
defined for a Borel set F by 


Fav(F) = f \Tl 0 (F)\dO. 

Jo 

Thus Fav(F) = 0 if and only if I\(F) = 0. Consequently, to say that a set has “zero 
Buffon needle probability” is another way of saying that almost every projection of the 
set onto a line has zero Lebesgue measure. 

2. SELF-SIMILAR FRACTALS IN THE PLANE AND THEIR PROJEC- 
TIONS. There is no universally agreed upon definition of a “fractal,” but “we know 
one when we see one.” The sets that we discuss in this article are certainly fractals, 
but they would be excluded if we used the naive definition of a fractal as a set having 
nonintegral Hausdorff dimension (our sets in the Euclidean plane have dimension 
one). When we say “fractal” we usually have several properties in mind, such as 
“fine structure on arbitrarily small scales” and some form of “self-similarity” (see [7] 
or [8]). The latter term will be used here in the following precise sense: a compact set 
in Euclidean space is said to be self-similar if it can be represented as a finite union 
of (not necessarily disjoint) sets, each of which can be obtained from the whole set by 
scaling (with a factor less than one), followed by an isometry. 

The best known fractal is probably the Cantor middle-third set. We consider a vari- 
ant of the Cantor set construction where at each stage we remove middle halves of 
intervals rather than their middle thirds. 

Start with the unit interval F 0 = [0, 1]. Remove the (open) middle half — resulting 
in F\ = [0, 1/4] U [3/4, 1]. Then repeat the process removing the middle half of each 
of the intervals that remain (see Figure 1). At stage n we get a set F n that is the 

union of 2 n intervals of length A~ n . These sets are nested: F 0 D F\ D F 2 D Their 

intersection 


Figure 1 . Construction of the Cantor middle-half set. 
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00 


*=n f " 

n= 0 


(nonempty by the nested intersection theorem) is called the Cantor middle-half 
set. 

Observe that \F n \ — (l/2) n — ► 0, so \K\ = 0. Thus, the length measure is not very 
useful when applied to the set K. (It turns out that the ‘natural’ measure on K is the 
1 /2-dimensional Hausdorff measure, but we do not discuss this here.) 

Notice also that the Cantor middle-half set is self-similar in the sense defined earlier. 
Indeed, the recursive nature of the construction allows us to write 

U (I* + §) . (2) 

Iterating formula (2) leads to a more explicit expression for the Cantor middle-half 
set in terms of expansions in base 4. In fact, K is the set of all numbers in [0, 1] 
admitting base 4 expansions in which the only allowable digits are 0 and 3 (or 0 and 
3/4, depending on whether we elect to index the series that gives the expansion starting 
with n = 1 or n = 0). Formally, 


K = \f^a n 4- n :a n e{0,3} 

I n = 1 


: a n e {0, 3/4} 


n = 0 


( 3 ) 


More generally, a self-similar set K in the real line can be defined by a set equation 


K = {J(rK+d i ), (4) 

1 = 1 


where r in (0, 1) is called the contraction ratio and the m numbers d \ , . . . , d m are said 
to be admissible digits. A nonempty compact subset of R satisfying (4) is unique [8, 
p.l 14] and can be expressed as follows: 


K = 


00 


J2 a * r> 


. a n G {d \ , ^ 2 ? • • • 9 d m j / . 


( 5 ) 


(It seems natural to assume that the digits d t are all distinct. However, if we have a 
family of self-similar sets depending on parameters, then for some parameter values 
two or more digits may coincide, so we do not preclude such a possibility.) In fact, 
we do not have to restrict ourselves to the real line — everything works the same way 
in the plane and in higher dimensions. The only change that needs to be made is to 
allow the digits in (4) and (5) to be vectors. Of particular interest to us will be the set 
K 2 = K x K, where K is the Cantor middle-half set. It can be described directly by a 
Cantor-like construction in which we start with the unit square [0, l] 2 = [0, 1] x [0, 1] 
at stage 0, and at stage n replace each of the squares from the previous stage by Tour 
comer squares of side-length 4~ n (see Figure 2). The alternative “digit” description of 
the set K 2 is: 



We remark that the “four comer” set K 2 is a union of four pieces, each of which is a 
translate of K 2 scaled by a factor of 1 /4. This self-similar structure suggests that K 2 is 
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a one-dimensional set. Indeed, one can show that 0 < H l ( K 2 ) < oo. The upper bound 
follows easily from the definition of 7i l , since K 2 is covered by 4 n squares of diameter 
\fl • 4~ n for each n. The lower bound can be derived by considering the probability 
measure /a on K 2 such that /jl(K 2 Pi Q) = 4~ n for each of the 4 n squares Q arising at 
stage n of the construction (see [8, 1.3] for a justification that such a measure exists). 
This measure satisfies < 9 diam(P) for any closed set B, since B can be covered 
by at most nine dyadic squares with side-length smaller than diam(P). Therefore, for 
any cover { B t } of K 2 by disks, we have Yi diam (B t ) > Yi At(fl/)/9 > fi{K 2 )/9 = 
1/9. For more details, we refer the reader to [8, chap. 4]. 

We would like to understand the projections of self-similar sets in the plane. Again, 
let Y\ e : R 2 — > R denote the orthogonal projection on the line Lq through the origin 
that makes an angle 9 with the horizontal. It is more convenient, however, to consider 
all projections to be subsets of R. To this end, we let 

P e = X cos 0 + y sinO (7) 

for 0 in [0, tt). Clearly, II ^ = RgPg, where R e is the rotation through angle 0 , so 
IP# (A) | = in* (A) I for any set A of the plane. If A is a self-similar set in the plane 
given by (4) or (5), but now with vector digits b,-, then P e ( A) is a self-similar set 
in R with the same contraction ratio r and with digits P*(bi), . . . , Pg( b m ). (Observe 
that some of the digits for the projection may coincide even if all the vector digits are 
distinct.) However, these projected sets are often very complicated because of possible 
overlap (see Figure 3). In general, they can no longer be described via an iterative 
process of “removals,” as was the case for the Cantor middle-half set. Figure 3 shows 
the projection of the first stage of the construction of the set K 2 . When we iterate the 
construction, the interplay between “gaps” and “overlaps” becomes quite involved. 
(Note that in Figure 3 we translated the set K 2 away from the origin in order to make 
the picture more illuminating. This just translates the projections.) 

Some projections of K 2 are easy to describe. Indeed, Po(K 2 ) = K is the Cantor 
middle-half set, hence |P 0 (^ 2 )I = 0. Also, P^^K 2 ) = K , so |P^ /2 (^ 2 )I = 0. This 
observation has a remarkable consequence, in view of the following theorem of Besi- 
covitch [2]. 

Theorem 2 (Besicovitch). Let F be a compact subset of M 2 with the property that 
0 < TC l (F) < oo. If two distinct projections of F have zero length , then almost every 
projection of F has zero length. 
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Figure 3. Projecting the “four corner” set. 


The theorem clearly applies to the set K 2 . “Almost every projection” means that 
there is a set E in [0, n) of zero Lebesgiie measure such that \Po(F)\ = 0 for all 9 not 
belonging to E. The proof of Theorem 2 is quite complicated. Our goal is to give an 
elementary proof of it for a class of self-similar sets F that includes the “four comer” 
set K 2 . One advantage of this elementary approach is that it can be refined to yield 
quantitative estimates (see section 3). 

Proposition 3. Suppose that m > 3 and that 


A = 


OO 

^ ] a n m • ^ {bl > t)2> ... ? } ■ , 

n= 0 


( 8 ) 


where bi, b 2 , . . . , b m are distinct vectors in R 2 such that the “pieces” A t = b, + 
m _1 A are pairwise disjoint. Then |P 6 »(A)| = 0 for almost every 9 in [0, n). 

One can check that for m = 2 the set A, defined as in (8), is a line segment, so all 
its projections, except one, have positive length. For m > 3 the disjointness condition 
precludes the set A from being a subset of a line; this is proved in Lemma 6. 

The proof of Proposition 3 is based on several lemmas dealing with subsets of R. 


Lemma 4. Let F be a compact subset of R with positive Lebesgue measure. Then for 
any 8 > 0 there is an interval J such that \ F fl /| > (1 — 8)\J\. 

Proof. (This follows immediately from Lebesgue’s theorem on points of density, but is 
actually more elementary.) By (1) and the compactness assumption, F can be covered 
by finitely many open intervals {/,■} such that 
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X)i/ i i<a-«r 1 iFi. 

i 

If | F fl I( | < (1—6)17/1 held for all i , then summing over i would yield a contradiction. 


In the following three lemmas we consider the set 


K = K({d u ...,d m }) = 



. & n £= {d \ , . . . , d m } / , 


(9) 


where d\,di, . . . ,d m are real numbers (not necessarily distinct). Thus, K is a self- 
similar subset of R given by (5), but of the special form in which the contraction ratio r 
is equal to m 1 . the reciprocal of the number of digits. In the proof of Proposition 3 
we will have K = P e ( A) for a fixed 0; then dj = P e fb,). 


Lemma 5. For K as in (9), let K t = d t + m~ l K. Then K = (JJli K, and \ K, n Kj \ = 
0 for i £ j. 


Proof. The first statement follows immediately from the definition of K. The second 
statement is an easy consequence of self-similarity. The set K is a union of m pieces, 
each of which is a translate of m~ 1 K. Since | AT| = m ■ K\. the pieces have to be 

pairwise disjoint in measure. The formal proof goes as follows: Fix 1 < i < j < m. 
We have 


IATI = 


IK 

k=l 


<\K t UKj\+ £ \K k \ = - \KiDKjl. 

k<m\ k^i,j 


k = 1 


But |^| = \d k + m 1 K\ = m 1 1 K |, hence \K t n Kf < 0. 


Observe that the lemma covers the case d t = dj as well. Then K t fl Kj = K t , but 
| AT | = \Ki\ = 0, so there are no surprises here. 

The previous lemma says that the pairwise intersections AT, (T K, cannot be “large.” 
However, at least one of them must be nonempty. 


Lemma 6. Let K and AT,- be as in Lemma 5. There exist indices i and j, i f j, such 
that Kj n Kj f 0. 


Proof. Let N E (F) = {x : distfx, F) < e) denote the neighborhood of radius £ of a 
set F. Suppose that the sets K k are pairwise disjoint. Since they are compact, the 
distance between any two of them is positive. Thus we can find £ > 0 so that N e (K,) n 
N e (Kj) = 0 whenever i f j . Then we have 


\N S (K)\ = 


U 


k=\ 


= 'f^\N e (K k )\. 


k= 1 


On the other hand, 

= \N e {d k +m~ x K)\ = \N e (m~ l K)\=rn- l \N me {K)\, 
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since mN s (m~ l K) = N me (K). We conclude that | A^(AT)| = \N me (K)\. This is a con- 
tradiction, for both N e (K ) and N me (K ) are bounded open sets, and N S (K ) is obviously 
a proper subset of N me (K). ■ 

Before stating the next lemma we need to introduce some notation. Recall that the 
self-similar set K has a representation 

m m 

K = {jK l =\J(d i +m'K). 

i = 1 i—\ 

Substituting this formula into each term in its right-hand side, we obtain 

m 

K = U K tJ , K u = di + m~'dj + m~ 2 K. 

i,j = 1 

The sets K t and K t j are called the cylinder sets of K of orders 1 and 2, respectively. 
This operation can be iterated. For each positive integer l the set K is the union of 
pieces, called cylinders of order I, each of which is a translate of m~ l K. Let A = 
{1,2,..., m) and A 1 = {u = u \ . . . : u t e A}. Then 

i 

K = U K u , K u = K ui ... um = J2 d » n m ~ n+1 +rn~ l K. 

ueA e "=1 

Repeating the proof of Lemma 5 for this decomposition demonstrates that 

\K u DK v \=0 (10) 


for different u and u in A 1 . 

We want to understand when \ K\ =0. There is an easy sufficient condition: | K \ = 0 
if two cylinders of K coincide; i.e., if K u = K v for some distinct u and v in A £ . This 
can be seen in many ways; for instance, \K\ = m l \K u \ = m l \K u Pi K v \ = 0 by (10). 
This condition is too strong to be necessary, however: for the planar self-similar set A 
in (8), there are just countably many 0 in [0, n) for which A° = P e { A) has two coin- 
ciding cylinders. Thus we would like to know what happens if some cylinders “almost” 
coincide. 

Definition 7. Two cylinders K u and K v are e -relatively close if u and v belong to A e 
for some i and K u = K v + x for some x with |jc| < £ • diam (K u ). 

Note that diam(if M ) = diam(^T u ) = m~ l diam(^f) for all u and v in A 1 . Let 

i 

d u = 

n = 1 

so that K u = d u + m~ l K. Then K u and K v are ^-relatively close whenever 

\du-d v \ (11) 

Lemma 8. If for every e > 0 there exist an index t and distinct u and v in A 1 such 
that K u and K v are e-relatively close, then \K\ =0. 
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This lemma (and its converse, which holds as well) is a very special case of a theo- 
rem by Bandt and Graf [1]. 

Proof. Suppose, to the contrary, that |jRT| > 0. Then we can find an interval J such 
that \ J D K\ > 0.9|/| (Lemma 4). Let s = |/|/(2diam(jRT)). By assumption, there 
exist an index i in N and distinct u and v in A 1 such that the cylinders K u and K v are 
^-relatively close. If J u = d u + m~ l J and J v = d v +m~ l J , then 

Ju Jv (d u dy) 


and 


\d u — d v \ < sm ^diam(A r ) = 0.5|/ tt |. 

This means that J u and J v have a large overlap — at least half of J u lies in J v . Since 
J was chosen to ensure that at least 90 percent of its length belongs to the set K , this 
property carries over to J u and K u . To be more precise, 

| J U DK U \ = \(d u +m~ e J) H (du+m~ l K)\ = m~ l \J Pi K\ > 0.9m“ € |/| =0.9|/J. 
Similarly, 


\J v nK v \>0.9\J v \. 

Since at least 90 percent of J v is in K v and at least 50 percent of J u is in J v , we find 
that at least 40 percent of J u is in K v . But at least 90 percent of J u is in K u , so at least 
30 percent of J u is in K u {^ K v . This is in contradiction with (10), and the lemma is 
proved. ■ 

Proof of Proposition 3. Recall that A is a planar Cantor set of description (8). If we 
again write A 6 = P e ( A), then 


A 0 = \J(Pe(b i ) + m ~ 1 A 0 ), 

1 = 1 

so all the foregoing discussion (in particular, Lemma 8) applies to A°. Let 

V e \= |^J{0 e [0, 7 r) : there exist distinct u and v in A e 

i> i 

with A e u and A° v ^-relatively close}. 

Note that if 9 lies in f\>o then \A°\ = 0 by Lemma 8. The proposition will be 
proved if we are able to show that |[0, n) \ n e>0 V fi | = 0. But n £>0 H Vi /», 

and by DeMorgan’s law, 


[0, n) \ P| V 1 /n = U([°, 7T> \ Vi/„). 

n = 1 n = 1 

Thus, it remains to verify that 

I [0, n) \ V E | = 0 


( 12 ) 


for a fixed e. 
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We would like to show that any subinterval of [0, n) has at least a fixed percentage 
of its length (depending on s but not on the subinterval’s size) lying in V e . In view 
of Lemma 4, this will imply (12). The desired conclusion will follow if we can find 
positive constants C\ and C 2 such that for any 0 in (0, n) and any t in N there is a 0o 
satisfying 


1 0 — 0o I < C\m l , (0o — Ctfm l , 0 O + C^sm l ) C V £ . (13) 


(There is a minor technical issue when the interval in (13) is not contained in [0, n), 
but it is easy to handle.) 

We fix 0 in (0, n) and l in N. Appealing to Lemma 6, we choose i and j with i ^ j 
for which Af fl A® ^ 0. Since 

a?= U a;, a‘= ij a;, 

ueA^, u\—i veA^, v\=j 


there exist u and v in with u\ = i and = j , such that A e u n A e v ^ 0. This means 
that there are points y u in A u and z v in A v such that Pe(y u ) = Pe(z v )• Denote by 
z u the point in A u corresponding to z v (i.e., A v — z v = A M — z u \ and let 0 O be the 
angle such that Pe 0 (z u ) = Pe 0 (z v ), whence A e u ° = A 6 V °. Then |0 — 0ol is the angle at z v 
for the triangle with vertices z u * z v , and y u (see Figure 4), and therefore |z M — y M | > 
\y u — z v \ sin |0 — 0 O |. This implies that 


sin |0 - 0 O | < 


diam(A M ) 
dist(A M , A v ) 



Note that diam(A M ) = m~ l diam(A) and dist(A M , A v ) > dist(A, , A ; ) > 8 > 0 for 
some 8 = 8(A) > 0 by the hypothesis of pairwise disjointness in Proposition 3. Thus 
the first condition in (13) holds with the constant C\ — tt diam(A)/(2<5). By the choice 
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of Oq we have Oq in Vo, which is a subset of V £ . Using (7), we conclude that 


I (Pa ~ Pd 0 )(Z - W)| < \0t ~ Oq | • |Z - W| 


for any two vectors z and w and any a e [0, n). Consequently, the projected set A" 
can be obtained from A" by a translation of at most \a — 0 q\ • diam(A). On the other 
hand, the diameter of A" is at least • width(A), where width(A) signifies the 
minimal width of a strip that contains A; it is nonzero because the assumption that 
the sets A; are disjoint prevents A from being contained in a straight line (recall 
Lemma 6). Set C 2 = width(A)/ diam(A). Then by Definition 7 the cylinders A a u and 
A" are ^-relatively close for all a in the C 2 £m"U neighborhood of Oq. It follows that 
this neighborhood lies in V e , so the second condition in (13) is verified. This completes 
the proof. ■ 

3. CONCLUDING REMARKS. First we explain what we meant by “quantitative 
estimates” alluded to after the statement of Theorem 2. Recall that the Favard length 
of a planar set F is defined by Fav(F) = f* \P 0 (F)\dO. Denote by K the Cantor 
middle-half set. By Proposition 3, Fav(^ 2 ) = 0. Now consider the nth stage of the 
Cantor set construction for K : 

P a k 4“* : a k e {0, 3} for 1 < k < n and a k G {0, 1 , 2, 3} for k > n | . 

Then K 2 is a union of A n squares of side-length 4“\ Clearly, Fav(^ 2 ) = 0 implies that 
lim^oo Fav(^f 2 ) = 0, but it does not yield an estimate for the rate of convergence. A 
lower bound Fa v(A^) > c/n for some c > 0 follows from Theorem 4.1 in [14]. The 
elementary method described in the present article can be refined to yield a quantitative 
upper bound [17]. For y > 1 we employ the notation 


log* y = min 


n > 0 : log log ... log y < 1 

n 


(14) 


Theorem 9 (Peres and Solomyak). There exist positive constants C and a such that 

Fav(^) < C exp[— a log* n] 


for all n in N. 

The convergence of the upper bound in Theorem 9 to zero is extremely slow, but it is 
the best estimate currently known. It is a challenging unsolved problem to determine 
the correct asymptotics. We would guess that the lower bound c/n is closer to the truth. 

We make a few additional comments. 

• It follows from results of Kenyon [12] and Lagarias and Wang [13] that \ P e (K 2 )\ = 
0 for all 0 such that tan 0 is irrational. (This is, of course, much stronger than Propo- 
sition 3!) However, this information does not seem helpful in obtaining an upper 
bound for Fav(^T 2 ). 

• Using polar duality, one can infer from the properties of the four-comer set K 2 the 
existence of Kakeya sets , i.e., compact planar sets of zero area that contain line 
segments in every direction. This connection was discovered by Besicovitch [4]; see 
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also Besicovitch [ 2 ], [ 3 ] and Kahane [ 11 ]. Note that the latter paper does not really 
prove that almost every projection of K 2 has zero length. 

• The approach of this paper was originally developed in [ 16 ] to study the Hausdorff 
measure of certain parameterized families of fractals. 
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